Cursive Character Challenge: a New Database for Machine Learning and Pattern Recognition
نویسندگان
چکیده
Cursive character recognition is a challenging task due to high variability and intrinsic ambiguity of cursive letters. This paper presents C-Cube (Cursive Character Challenge), a new public-domain cursive character database. C-Cube contains 57293 cursive characters manually extracted from cursive handwritten words, including both upper and lower case versions of each letter. The database can be downoloaded from the web and it provides predefined experimental protocols in order to compare rigorously the results obtained by different researchers.
منابع مشابه
Cursive character recognition by learning vector quantization
This paper presents a cursive character recognizer embedded in an o-line cursive script recognition system. The recognizer is composed of two modules: the ®rst one is a feature extractor, the second one a learning vector quantizer. The selected feature set was compared to Zernike polynomials using the same classi®er. Experiments are reported on a database of about 49,000 isolated characters.
متن کاملA SVM-based cursive character recognizer
This paper presents a cursive character recognizer, a crucial module in any cursive word recognition system based on a segmentation and recognition approach. The character classification is achieved by using support vector machines (SVMs) and a neural gas. The neural gas is used to verify whether lower and upper case version of a certain letter can be joined in a single class or not. Once this ...
متن کاملA Robust Free Size OCR for Omni-Font Persian/Arabic Printed Document Using Combined MLP/SVM
Optical character recognition of cursive scripts present a number of challenging problems in both segmentation and recognition processes and this attracts many researches in the field of machine learning. This paper presents a novel approach based on a combination of MLP and SVM to design a trainable OCR for Persian/Arabic cursive documents. The implementation results on a comprehensive databas...
متن کاملHandwritten Bangla Digit Recognition Using Deep Learning
In spite of the advances in pattern recognition technology, Handwritten Bangla Character Recognition (HBCR) (such as alpha-numeric and special characters) remains largely unsolved due to the presence of many perplexing characters and excessive cursive in Bangla handwriting. Even the best existing recognizers do not lead to satisfactory performance for practical applications. To improve the perf...
متن کاملCombining neural gas and learning vector quantization for cursive character recognition
This paper presents a cursive character recognizer, a crucial module in any Cursive Script Recognition system based on a segmentation and recognition approach. The character classi2cation is achieved by combining the use of neural gas (NG) and learning vector quantization (LVQ). NG is used to verify whether lower and upper case version of a certain letter can be joined in a single class or not....
متن کامل